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Description 



METHOD FOR ESTIMATING A PITCH 
ESTIMATION OF THE SPEECH SIGNALS 

Background of Invention 
[0001] 1. Field of the Invention 

[0002] The present invention relates to a method for estimating a 
pitch estimation, and more specifically, to a method for 
calculating a pitch estimation with the autocorrelation 
method. 

[0003] 2. Description of the Prior Art 

[0004] vvith improvements in the field of electronic wireless com- 
munication and the increase in popularity of multimedia 
systems and the internet, the demand for sound signal 
encoding and analyzing has increased alongside. Sound 
telecommunication is an important application in next 
generation networks and also holds an important role in 
the multimedia telecommunication of the network. 

[0005] Telecommunication is widely applied to the technique of 



sound signal encoding, so specification in telecommuni- 
cation is quite important. There are currently some speci- 
fications in the International Telecommunication Union: 
PCM(64Kpbs),G711(64Kpbs),G726 (ADPCM 
16,24,32,40Kpbs),C728(Low Delay CELP 
16Kpbs),G728(Low Delay CELP 8Kpbs). At the moment, 
the specifications of the cellular mobile telephone systems 
in North American is the VSELP encoding technique of the 
TIA(Telecommunication Industry Association). The specifi- 
cations of the cellular mobile telephone systems in Japan 
and Europe are RPE-LTP encoding technique of the 
JDCQapanese Digital Cellular) and GSM(Global System for 
Mobil Telecommunication). The current encoding tech- 
nique is still at 8 Kbps. The encoding technique of the 
new generation is at 4.8 Kbps (LD-CELP)-2.4 Kbps 
(MELP,STC). For achieving such a ratio, the operation com- 
plexity is also raised. The general digital signal processor 
is used for finishing the immediate operation. 
[0006] For matching the design, there is one or a plurality of dig- 
ital signal processors in the special application design for 
sound compression or sound identification. The features 
of the DSP include a short instruction cycle, high paral- 
lelism and a plurality of special address modes to resolve 



general digital signal processing. The step with large 
amounts of operations in the voice processing is the step 
with the pitch estimation. The step is calculated according 
to equation 1. 

« = 0 

pitch period = {t \ max[ 

equation 1 

[0007] Equation 1 is the operation of the autocorrelation; x[n]is a 
sound signal comprising a plurality of voice data from x[0] 
to x[N-l]. Voice data x[n+T] is a sound signal generated 
according to a sound signal x[n] and lags a lag parameter. 
Sound signal x[n+T] is from x[t] to x[N-1+t]. R[t] is a au- 
tocorrelation value corresponding to a lag parameter. R[t] 
is the value of the amount of the voice data in the sound 
signal x[n] times the corresponding voice data in the 
sound signal x[n+T]. 

[0008] The autocorrelation operation in the method for estimat- 
ing the pitch estimation according to the prior art calcu- 
lates a plurality of autocorrelation value according to each 



lag parameter. Then a plurality of autocorrelation values 
are compared and the maximum autocorrelation value of 
these autocorrelation values is found. The lag parameter 
corresponding to the maximum autocorrelation value is 
used for calculating the pitch estimation. 
[0009] Additionally, normalizing autocorrelation method can also 
be used to estimate the pitch estimation. Please refer to 
equation 2. 

R[Tf = 



N-1 
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equation 2 

[0010] The normalizing autocorrelation method calculates the 
value R[t]^ according to equation 2, i.e. the value R[t]^ is 
calculated according to each lag parameter Tina plurality 
of lag parameters t. The values R[t]^ are stored in a mem- 
ory and compared, so the maximum R[t]^ is found. Then 



the lag parameter t corresponding to the maximum R[t] 
is used for calculating the pitch estimation. 
[0011] The amount of the operation of these two kinds of 

method for estimating pitch estimation in the digital sig- 
nal processor is quite large. When the data bulk of the en- 
try sound data is larger, the time of data processing is 
longer. When the sound signal cannot be operated imme- 
diately, the quality of the sound signal will be lowered. 
Summary of Invention 

[0012] It is therefore a primary objective of the claimed invention 
to provide a method for calculating a pitch estimation with 
the autocorrelation method. 

[0013] According to the claimed invention, the method calculates 
a pitch estimation of a sound signal with a voice proces- 
sor. The sound signal comprises a plurality of sound data. 
The method comprises the following steps: (a) determin- 
ing a pitch upper bound value and a pitch lower bound 
value according to the signal and corresponding pitch 
ranges in a database; (b) calculating a lag parameter up- 
per bound value and a lag parameter lower bound value 
according to the pitch upper bound value and the pitch 
lower bound value determined in step (a); (c)using the 
voice processor to generate a plurality of autocorrelation 



values according to a plurality of pointer values between 
the lag parameter lower bound value and the lag parame- 
ter upper bound value; and (d)comparing the plurality of 
autocorrelation values to find the maximum of the plural- 
ity of autocorrelation values and calculating the pitch esti- 
mation of the sound signal according to the lag parameter 
corresponding to the maximum autocorrelation values. 
[0014] These and other objectives of the claimed invention will 
no doubt become obvious to those of ordinary skill in the 
art after reading the following detailed description of the 
preferred embodiment that is illustrated in the various 
figures and drawings. 
Brief Description of Drawings 

[0015] Fig.l is a block diagram of a voice processing device ac- 
cording to the invention. 

[0016] Fig. 2 is a flowchart of a method for estimating a pitch es- 
timation in the first embodiment according to the inven- 
tion. 

[0017] Fig. 3 is a flowchart of a method for estimating a pitch es- 
timation in the second embodiment according to the in- 
vention. 
Detailed Description 



[0018] Please refer to Fig.l. Fig.l is a block diagram of a voice 
processing device 10 according to tlie invention. A sound 
signal is input in a voice processing device 10. The voice 
processing device 10 comprises a voice processor 12 for 
processing sound signal x[n], a memory 14 for storing a 
plurality of lag parameters and autocorrelation values R[t] 
calculated by the voice processing device 10 and a 
database 18 for storing the sound signal x[n] and corre- 
sponding pitch range. The sound signal x[n] is generated 
by a sound signal generator 16 and input in the voice 
processing device 10. 

[0019] The database 18 is used for storing different sound sig- 
nals and corresponding pitch ranges. When the voice pro- 
cessing device 10 receives a sound signal x[n], the voice 
processor 12 compares the sound signal x[n] and the data 
in the database 18 to analyze which kind of sound signal 
the sound signal x[n] is and calculate the pitch range of 
this kind of sound signal to determined the pitch upper 
bound value 




and the pitch lower bound value 



[0020] Please refer to Fig. 2. Fig. 2 is a flowchart of the method for 
estimating a pitch estimation in the first embodiment ac- 
cording to the invention. The method for estimating the 
pitch estimation in the invention is operated according to 
equation 3. The method comprises the following steps: 

-^[^1 ^ ^^^M^ "'"^] equation 3 



[0021] Step 200: determining a pitch upper bound value 



according to the signal x[n] and corresponding pitch 
ranges in a database 18; 
[0022] Step 202: calculating a lag parameter upper bound value 



wherein n=i X An 



1 = l,2f3f... f ceil ( — ) 




and a pitch lower bound value 




and a lag parameter lower bound value according to the 
pitch upper bound value 



and the pitch lower bound value determined in step 200; 
[0023] Step 204: using the voice processor 12 to generate a plu- 
rality of autocorrelation values R[t] according to a plurality 
of pointer values between the lag parameter lower bound 
value 



[0024] Step 206: comparing the plurality of autocorrelation val- 
ues R[t] to find the maximum of the plurality of autocor- 
relation values R[t] and calculating the pitch estimation of 
the sound signal x[n] according to the lag parameter t 
corresponding to the maximum autocorrelation values 
R[T]. 

[0025] In step 200, the voice processor 12 determines a pitch 
upper bound value 





and the lag parameter upper bound value 




;and 



and a pitch lower bound value 

according to signal x[n] and the corresponding pitch 
ranges in a database 18; 
[0026] In step 202, the voice processor 12 calculates a lag pa- 
rameter upper bound value 

and a lag parameter lower bound value 

A, 

according to the pitch upper bound value 
and the pitch lower bound value 

determined in step 200. The lag parameter upper bound 
value 



is that the sampling frequency divided by the pitch lower 
bound value 

and the lag parameter lower bound value 

is the sampling frequency divided by the pitch upper 
bound value 

[0027] In the step 204, the voice processor 12 is used for gener- 
ating a plurality of autocorrelation values R[t] according to 
a plurality of pointer values between the lag parameter 
lower bound value 

and the lag parameter upper bound value 



.An increment value is set and equal to the lag parameter 
lower bound value 

A, 

,the increment value being equal to the difference be- 
tween two neighboring pointer values. The first pointer 
value is equal to the lag parameter lower bound value 

;the second pointer value is equal to two times the lag pa- 
rameter lower bound value 2 

A, 

and other pointer values is the multiple of the lag param- 
eter lower bound value. The maximum pointer value is the 
lag parameter value 

w« 

[0028] In step 206, the voice processor device 10 compares the 
autocorrelation values to find the maximum of the auto- 
correlation values R[t] and calculates the pitch estimation 
according to the corresponding lag parameter t and the 



equation 4. 



pitch = 




equation 4 

[0029] Please refer to Fig. 3. Fig. 3 is a flowcliart of tlie metliod for 
estimating a pitch estimation in tlie second embodiment 
according to tlie invention. 

[0030] Step 300: determining a pitcli upper bound value 



according to signal x[n] and corresponding pitch ranges in 
a database 18; 

[0031] Step 302: calculating a lag parameter upper bound value 



and a lag parameter lower bound value according to the 
pitch upper bound value 




and a pitch lower bound value 





and the pitch lower bound value determined in step 200; 

[0032] Step 304: using the processor 12 to calculate a plurality of 
autocorrelation values R[t]; 

[0033] Step 306: using the shifting equation in the database 18 
to calculate a threshold value R ^ according to the plurality 
of the autocorrelation values R[t] in the step 304; 

[0034] Step 308: comparing the plurality of autocorrelation val- 
ues R[t] with each other to find the lag parameters corre- 
sponding to the autocorrelation values R[t] that are larger 
than the threshold value R^^; the lag parameters corre- 
sponding to the autocorrelation values R[t] that are larger 
than the threshold value R are the set B; 

th 

[0035] Step 310: calculating the autocorrelation value R[t] corre- 
sponding to the each lag parameter t in the set B; the au- 
tocorrelation values R[t] corresponding to the each lag 
parameter t in the set B are the set C; and 

[0036] Step 312: Calculating the pitch estimation according to 
equation 4 and the lag parameter t corresponding to the 
maximum autocorrelation value R[t] in the set C. 

[0037] In step 300, the voice processor 12 determines a pitch 
upper bound value 



and a pitch lower bound value 

according to the signal x[n] and corresponding pitch 
ranges in a database 18; 
[0038] In step 302, the voice processor 12 calculates a lag pa- 
rameter upper bound value 

and a lag parameter lower bound value 

A, 

according to the pitch upper bound value 
and the pitch lower bound value 

determined in step 200. The lag parameter upper bound 
value 



is the sampling frequency divided by the pitch lower 
bound value 

and the lag parameter lower bound value 

is the sampling frequency divided by the pitch upper 
bound value 

[0039] In step 304, the voice processor 12 is used for generating 
a plurality of autocorrelation values R[t] according to a 
plurality of pointer values between the lag parameter 
lower bound value 

,the lag parameter upper bound value 



and equation 3. 
[0040] In steps 306 and 308, the shifting equation in the 

database 18 is used for calculating a threshold value R^^ 
according to the plurality of the autocorrelation values 
R[t] in the step 304. The plurality of autocorrelation val- 
ues R[t] are compared with each other to find the lag pa- 
rameters corresponding to the autocorrelation values R[t] 
that are larger than the threshold value R^^. The lag pa- 
rameters corresponding to the autocorrelation values R[t] 
that are larger than the threshold value R is set B. An in- 

th 

crement value is set and equal to the lag parameter lower 
bound value 



,the increment value being equal to the difference be- 
tween two neighboring pointer values. The first pointer 
values is equal to the lag parameter lower bound value 



;the second pointer value is equal to two times the lag pa- 
rameter lower bound value 2 






and other pointer values is the multiple of the lag param- 
eter lower bound value. The Maximum of the pointer value 
is the lag parameter value 

w« 

[0041] In steps 310 and 312, each autocorrelation value R[t] cor- 
responding to the each lag parameter t in the set B is cal- 
culated according to the lag parameter t in the set B and 
the equation 3. the autocorrelation values R[t] corre- 
sponding to the each lag parameter t in the set B are set 
C. Then the pitch estimation is calculated according to the 
equation 4 and the lag parameter t corresponding to the 
maximum autocorrelation value R[t] in the set C. 

[0042] Compared to the prior art, the pitch range of the sound 
signal in the invention is determined according to the 
database 18. The lag parameter upper bound value and 
the lag parameter lower bound value are calculated ac- 
cording to the pitch upper bound value and the pitch 
lower bound value. After that, the pointer values between 
lag parameter upper bound value and the lag parameter 
lower bound value are chosen for calculating the pitch es- 
timation. The method for calculating the pitch estimation 



in the invention is different from the method for calculat- 
ing the pitch estimation according to the prior art that 
uses all the parameters to calculate the autocorrelation 
values. The method for calculating the pitch estimation in 
the invention reduces the amount of operations and en- 
sures that the pitch estimation is exactly determined. 
[0043] Those skilled in the art will readily observe that numerous 
modifications and alterations of the method and device 
may be made while retaining the teachings of the inven- 
tion. Accordingly, the above disclosure should be con- 
structed as limited only by the metes and bounds of the 
appended claims. 



